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Abstract 

Multiple transmit antennas in a downlink channel can provide tremendous capacity (i.e. multiplexing) 
gains, even when receivers have only single antennas. However, receiver and transmitter channel state 
information is generally required. In this paper, a system where each receiver has perfect channel 
knowledge, but the transmitter only receives quantized information regarding the channel instantiation 
is analyzed. The well known zero forcing transmission technique is considered, and simple expressions 
for the throughput degradation due to finite rate feedback are derived. A key finding is that the feedback 
rate per mobile must be increased linearly with the SNR (in dB) in order to achieve the full multiplexing 
gain, which is in sharp contrast to point-to-point MIMO systems in which it is not necessary to increase 
the feedback rate as a function of the SNR. 



I. Introduction 

In multiple antenna broadcast (downlink) channels, capacity can be tremendously increased by adding 
antennas at only the access point [1] [2]. In essence, an access point (AP) equipped with M antennas can 
support downlink rates up to a factor of M times larger than a single antenna access point, even when 
each mobile device has only a single antenna'. In order to realize these benefits, however, the access 
point must: 

'in fact, this is true on the uplink as well, by the multiple-access/broadcast channel duality [2] 
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• Simultaneously transmit to multiple users over the same bandwidth (orthogonal schemes such as 
TDMA or CDMA are generally highly sub-optimal). 

• Obtain accurate channel state information (CSI). 

Practical transmission structures that allow for simultaneous transmission to multiple mobiles, such as 
downlink beamforming, do exist. The requirement that the AP have accurate CSI, however, is far more 
difficult to meet, particularly in frequency division duplexed (FDD) systems. Training can be used to 
obtain chaimel knowledge at each of the mobile devices, but obtaining CSI at the AP generally requires 
feedback from each mobile. Such feedback channels do exist in current systems (e.g., for power control), 
but the required rate of feedback is clearly an important quantity for system designers. 

In this paper, we consider the practically motivated finite rate feedback model, in which each mobile 
feeds back a finite number of bits regarding its channel instantiation at the beginning of each block 
or frame. This model was first considered for point-to-point MIMO channels in [3] [4] [5], where the 
transmitter uses such feedback to more accurately direct its transmitted energy towards the receiver's 
anteima array, and even a small number of bits per anteima can be quite beneficial [6]. In point-to-point 
MIMO chaimels, the level of CSI available at the transmitter only affects the SNR-offset; it does not affect 
the slope of the capacity vs. SNR curve, i.e., the multiplexing gain. However, the level of CSI available to 
the transmitter does affect the multiplexing gain of the MIMO downUnk channel [1]. As a result, channel 
feedback is considerably more important for MIMO downlink channels than for point-to-point channels. 

In contrast to most recent work on the MIMO downlink channel which has primarily concentrated on 
channels with a very large number of mobiles [7] [8] [9], we consider systems in which the number of 
mobiles is equal to the number of transmit anteimas. This regime is apphcable for inherently smaller 
systems, as well as large systems in which stringent delay constraints do not allow a small subset of 
users to be selected for transmission. Random beamforming is an alternative Umited feedback strategy for 
MIMO downlink chaimels in which each mobile feeds back a very low rate quantization of the channel 
(log2 M bits) as well as an analog SNR value [7]. While this strategy performs well when there are a 
large number of mobiles relative to the number of transmit anteimas, it performs poorly in the small 
system regime which we consider. 

In this work we propose a simple downlink transmission scheme that uses zero-forcing precoding in 
conjunction with finite rate feedback. We consider systems in which each mobile performs vector quan- 
tization on its channel realization using random quantization codebooks, i.e. random vector quantization 
[10] [11]. Our key findings are: 

• The throughput of a feedback-based zero-forcing system is bounded if the SNR is taken to infinity 
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and the number of feedback bits per mobile is kept fixed. 

• The number of feedback bits per mobile (B) must be increased linearly with the SNR (in dB) at 
the rate 

^^^B 

in order to achieve the full multiplexing gain of M. In addition, this scaling of B guarantees that the 
throughput loss relative to perfect CSIT-based zero-forcing is upper bounded by M bps/Hz, which 
corresponds to a 3 dB power offset. 

• Scaling the number of feedback bits according to i? = a log2 P for any a < M — I results in a 
strictly inferior multiplexing gain of M 

In essence, the channel estimation error at the access point must scale as the inverse of the SNR in order 
to allow the full multiplexing gain to be achieved, which results in the required linear scaling of feedback. 
As a result of this scaling, feedback requirements are considerably higher in downUnk channels than in 
point-to-point MIMO channels, which will have considerable design implications. 

The MIMO downlink finite-rate feedback model was also considered independently by Ding, Love, 
and Zoltowski [12] 

The remainder of this paper is organized as follows. In Section ^ we describe the channel model and 
the finite rate feedback mechanism. In Section |ffl] we provide background material on MIMO downlink 
capacity, downlink precoding, and random vector quantization. In Section |^ we described the proposed 
zero-forcing based system, and analyze the throughput of this system in Section |3 We provide numerical 
results comparing finite -rate feedback systems to alternative transmission techniques in Section El and 
close by discussing conclusions and possible extensions of this work in Section IVIII 

II. System Model 

We consider a K receiver multiple antenna broadcast channel in which the transmitter (access point, or 
AP) has M > 1 antennas and each receiver has a single antenna. The broadcast channel is mathematically 
described as: 

yi = htx + rei, i = l,...,K (1) 

where hi,h2,...,hx are the channel vectors (with hj G C^^^^) of users 1 through K, the vector 
X G C^^^^ is the transmitted signal, and ni, . . . , hk are independent complex Gaussian noise terms with 
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Fig. 1. Finite Rate Feedback System Model 



unit variance. The input must satisfy a transmit power constraint of P, i.e. i?[||x|p] < P. We denote 
the concatenation of the channels by = [hi h2 • • • h^^], i.e. H is -fC x M with the i-th row equal 
to the channel of the i-th receiver (h|). In order to focus our efforts on the impact of imperfect CSI, 
we consider a system where the number of mobiles is equal to the number of transmit antennas, i.e., 
K = M. However, this work can be combined with so-called user selection algorithms for systems with 
more mobiles than antennas (K > M), as studied in [13]. 

The channel is assumed to be block fading, with independent fading from block to block. The 
entries of the channel matrices are distributed as iid unit variance complex Gaussians (Rayleigh fading). 
Furthermore, each of the receivers is assumed to have perfect and instantaneous knowledge of its own 
channel vector, i.e. hj. Notice it is not required for mobiles to know the channel of other mobiles. Partial 
CSI is acquired at the transmitter via a finite rate feedback channel from each of the mobiles, as described 
below. 

In the finite rate feedback model shown in Fig. ^ each receiver quantizes its channel (with hj assumed 
to be known perfectly at the i-th receiver) to B bits and feeds back the bits perfectly and instantaneously to 
the access point, which is assumed to have no other knowledge of the instantaneous state of the channel. 
The quantization is performed using a vector quantization codebook that is known at the transmitter 
and the receivers. A quantization codebook C consists of 2^ M-dimensional unit norm vectors C = 
{wi, . . . , W2b}, where B is the number of feedback bits per mobile. Similar to point-to-point MIMO 
systems, each receiver quantizes its channel to the quantization vector that is closest to its channel vector, 
where closeness is measured in terms of the angle between two vectors or equivalently the inner product 
[4] [5]. Thus, user i computes quantization index Fi according to: 

-Fj = arg max |htw,| = arg min sin^ (Z(hj, w,)) . (2) 
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and feeds this index back to the transmitter. Note that only the direction of the channel vector is quantized, 
and no information regarding the channel magnitude is conveyed to the transmitter. Magnitude information 
can be used to perform power and rate loading across multiple channels, but this generally of secondary 
concern when the number of mobiles is the same as the number of antennas. If there are more users 
than antennas (i.e. K > M), however, channel magnitude information can be used to assist with the user 
selection process [13]. 

Clearly, the choice of vector quantization codebook significantly affects the quality of the CSI provided 
to the access point. In this work, we analyze performance using random vector quantization (RVQ), in 
which an ensemble of random quantization codebooks is considered. Details of RVQ are discussed in 
Section UlTCl 

Notation: We use boldface to denote vectors and matrices and A^^ refers to the conjugate transpose, 
or Hermitian, of A. The notation ||x|| refers the Euclidean norm of the vector x, and Z(x, y) refers to 
the angle between vectors x and y with the standard convention | cos(Z(x,y))| = |xty|/(||x|| • ||y||). 

III. Background 

A. Capacity Results for MIMO Broadcast Channels 

In this section we summarize relevant capacity results for the multiple-antenna broadcast channel. 
When perfect CSI is available at transmitter and receivers, the capacity region of the channel is achieved 
by dirty-paper coding [14] [1] [15] [16] [17], which is a technique that can be used to pre-cancel multi- 
user interference at the transmitter [18]. In this paper we study the total system throughput, or the sum 
rate, which we denote as Csum.{)^,P)- At high SNR, the sum rate capacity of the MIMO BC can be 
approximated as [19]: 

CTx"/i?,x-cs/(H, P)^M log (P) + c (3) 

where c is a constant depending on the channel realization H. The key feature to notice is that capacity 
grows linearly as a function of M. Though the K (total) receive antennas are distributed amongst K 
receivers, the linear growth is the same as in a M-transmit, i^-receive antenna point-to-point MIMO 
system, i.e. both systems have the same multiplexing gain. 

If each of the mobiles suffers from fading according to the same distribution and the transmitter has 
no instantaneous CSI, the situation is very different. In this scenario, the channels of all receivers are 
statistically identical, and thus the channel is degraded, in any order. Therefore, any codeword receiver 1 
can decode can also be decoded by any other receiver, which implies that a TDMA strategy is optimal 
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[20, Section VI] [21]. Thus, the sum capacity of this channel is equal to the capacity of the point-to-point 
channel from the transmitter to any individual receiver: 



and therefore the multiplexing gain of this channel is only one. In fact, the downlink channel achieves a 
multiplexing gain of only one for any fading distribution (i.e., for any distribution on the magnitude of the 
chaimel) in which the spatial direction of each chaimel is isotropically distributed [21]. This includes a 
downlink chaimel in which users have unequal average SNR but each suffers from spatially uncorrelated 
Rayleigh fading. 

There clearly is a huge gap between the capacity of the MIMO downlink channel with transmitter CSI 
(multiplexing gain of M) and without transmitter CSI (multiplexing gain of 1). Thus, it is of interest to 
investigate the more practical assumption of partial CSI at the AP. If each of the mobiles has perfect 
CSI and the AP has imperfect CSI of fixed quality, (e.g., Rician fading with a fixed variance that is 
independent of the SNR), it has recently been shown that the multiplexing gain of the sum capacity 
is strictly smaller than M [22]^. Somewhat complementary to this result, our work shows that the full 
multiplexing gain of M can be achieved if the feedback rate (i.e., the quality of the CSI) is appropriately 
increased as a function of SNR. 

B. Downlink Precoding 

Though dirty paper coding is capacity achieving for the MIMO broadcast channel, the technique 
requires considerable complexity and practical implementations are still being actively pursued [23]-[25]. 
As a result, simpler downhnk transmission schemes are of obvious interest. One such scheme is downlink 
beamforming^, which incurs a rate/power loss relative to DPC but achieves the same multiplexing gain of 
M. In order to implement this scheme, the transmitter multiplies the symbol intended for each receiver 
by a beamforming vector and transmits the sum of these vector signals. Let Sj denote the scalar symbol 
intended for the i-th receiver, and let denote the corresponding unit norm beamforming vector. The 
transmitted signal is then given by: 




(4) 



K 




(5) 



^It is conjectured that the multiplexing gain in this scenario is in fact equal to one, although this has yet to be shown. 
^Downhnk beamfonning is also referred to as linear precoding or space-division multiple access (SDMA). 
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The received signal at user i is therefore given by: 

K 

yi = hjx + = ^ hJvjSj + n^, (6) 

and the SINR at mobile i is: 

SINRi = ^' \ ^ , , (7) 

under the assumption that each of the symbols has power P/M, and that no interference cancellation is 
performed at the mobiles. Note that the capacity-achieving strategy is similar to downlink beamforming 
with the addition of a pre-coding step at the transmitter which leads to the eUmination of some of the 
multi-user interference terms in the SINR expression. 

The performance of downlink beamforming clearly depends on the choice of beamforming vectors, but 
the problem of determining the sum rate maximizing beamforming vectors is generally very difficult. One 
simple choice of beamforming vectors are the zero-forcing vectors, which are chosen such that no multi- 
user interference is experienced at any of the receivers. This can be done by choosing the beamforming 
vector of user i orthogonal to the chaimel vectors of all other users, i.e., by choosing orthogonal to 
hj for all z 7^ j. It is easily seen that the zero-forcing beamforming vectors are simply the normahzed 
columns of the inverse of the concatenated chaimel matrix H. If such beamforming vectors are used, the 
received signal at the i-th mobile reduces to: 

K 

yi = htx + ni = ^ hI'vjSj + rii = htvjSi + rii (8) 

i=i 

because hjvj = for all i 7^ i by construction. Since all interference has been eliminated, the 
corresponding SNR is given as SNRi = -^|htvip. In fact, zero-forcing is optimal amongst all downlink 
beamforming strategies at asymptotically optimal at high SNR [19]. 

Since zero-forcing creates M independent and parallel chaimels, the resulting multiplexing gain is 
equal to M, which is the same as for the capacity-achieving DPC strategy. Zero forcing does however, 
incur a rate loss (or alternatively, a power loss) relative to capacity. At high SNR, the power loss of zero 
forcing relative to DPC converges to ^^^-^ I^jll^ i^rj dB, which is approximately equal to 31og2 M 
dB [19]. Clearly, the transmitter must have perfect channel knowledge in order to choose the zero-forcing 
beamforming vectors. If there is any imperfection in this knowledge, there inevitably will be some multi- 
user interference, which leads to performance degradation. 
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C. Random Vector Quantization 

In this work we use random vector quantization (RVQ), in which each of the 2^ quantization vectors 
is independently chosen from the isotropic distribution on the M-dimensional unit sphere. We analyze 
performance averaged over all such choices of random codebooks, in addition to averaging over the 
fading distribution. Random codebooks are used because the optimal vector quantizer for this problem is 
not known in general, and known bounds are rather loose. RVQ, on the other hand, is very amenable to 
analysis and also performs measurably close to optimal quantization, as is shown in Section IV-DI Note 
that each receiver is assumed to use a different and independently generated quantization codebook; if a 
common codebook was used, there would be a non-zero probability that multiple users return the same 
quantization vector, which complicates transmission. 

Random vector quantization was first used to analyze the performance of CDMA and point-to-point 
MIMO channels with finite rate feedback, and has been shown to be asymptotically optimal in the large 
system limit (e.g., infinitely many antennas) [10] [11]. There has also been very recent work characterizing 
the error performance of point-to-point MISO (multiple-input, single-output) systems utilizing RVQ [26]. 

We now review some basic results on RVQ from [26] that will be useful in later derivations. As stated 
earlier, the quantization vectors are iid isotropic vectors on the M-dimensional unit sphere, as are the 
channel directions hj = due to the assumption of iid Rayleigh fading. The most important quantity 
of interest is the statistical distribution of the quantization error. In order to determine this, first consider 
the inner product between a channel vector and a quantization vector: 

2 



Iwlh,;|2 



hj|| cos (Z(hi,w 



Because hj and Wj are independent isotropic vectors, the quantity |wjhjp = cos^ ^Z(hj,Wj)) is beta 



distributed with parameters 1 and M — 1, and sin ^Z(hj, Wj) j = 1 — cos (Z(hj, Wj)) is beta distributed 
with parameters M — 1 and 1. Thus the CDF oi X = s\v? ^Z(hj, Wj)^ is given by Pr(X < x) = x^^~^. 

Let hj denote the quantization of the vector hj, i.e. the solution to Since the quantization vectors 
are independent, the quantization error Z = sin^ ^Z(hj,hj)^ is the minimum of 2^ independent beta 
(M - 1, 1) random variables, and the CCDF of Z is given by Pr(Z > z) = {I - z^^-'^f [26, Lemma 
1]. The expectation of this quantity has been computed in closed form [26]: 



sin^(Z(h„h,,)) =2«./?(2^,^). (9) 



Here we use /?(•) to denote the beta function, which is defined in terms of the gamma function as 

r(^)r(y) 
r(x+y) 



[3{x,y) = ^^^1+^) 

[27]. The gamma function is the extension of the factorial function to non-integers. 
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and satisfies tlie fundamental properties 7(72) = (n — 1)! for positive integers and T{x + 1) = xr(x) for 
all X [27]. While the derivation of the expectation in [26] depends on the Pochmann symbol, this result 
can alternatively be derived using an integral representation of the beta function, as shown in Appendix 
U Furthermore, a simple extension of inequalities given in [26] gives a strict upper bound to the expected 
quantization error: 

Lemma 1: The expected quantization error can be upper bounded as: 



-E'h,w 

Proof: See Appendix Ull 



sin^ ( Z(hj,hj 



<2 ~ 



D. Point-to-Point MISO Systems with Finite Rate Feedback 

In this section we briefly review some basic results on point-to-point MISO systems (i.e., M transmit 
antennas, single receive antenna, which is equivalent to the given system model with K = I) with finite 
rate feedback, under the assumption of iid Rayleigh fading. If the transmitter has perfect CSI, it is well 
known that the optimum transmission strategy is to beamform along the channel vector h [28] and the 
corresponding (ergodic) capacity is Ccsit(^) = Ey^ [log (l + P||h|p)]. If the transmitter has no CSIT 
and only has knowledge of the fading distribution, the optimum transmission strategy is to transmit 
independent and equal power signals from each of the M transmit antennas, and the corresponding 
capacity is Cno-csixli^) = [log (l + ^||h|p)]. Clearly Cno-csn{P) = Ccsn{^), which corresponds 
to a 10 logio ^'^ dB shift in the capacity curve. Thus the lack of CSIT leads to a 10 log^Q M dB SNR 
loss relative to perfect CSIT. 

Providing the transmitter with partial CSIT via a finite rate feedback channel can be used to reduce this 
SNR loss. If the transmitter acquires CSIT through the finite rate feedback channel, an optimal or nearly 
optimal strategy is to beamform in the direction of the quantization vector'^. The average rate achieved 
with this strategy assuming RVQ is used is given by: 

Rfb{P) = ^h,w [log(l + P||h||2cos2(^Z(h,h); 

« [log(l + P||h||2(l-2^^))] , 

where we have based the approximation on the upper bound given in Lemma ^ which is numerically very 
accurate. Thus, the use of limited feedback leads to an SNR degradation of approximately 101ogxo(l — 

''Conditions for the optimality of beamforming along the quantization direction are provided in [29]. Though these conditions 
are difficult to analytically compute for most quantization codebooks, it is generally well accepted that beamforming performs 
extremely close to capacity. 
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Fig. 2. 4x1 MISO System with CSIT, No CSIT, & Feedback 

B 

2 '^'-i ) dB relative to perfect CSIT. Note that this approximation agrees with the expression derived for 
an asymptotically large number of transmit antennas in [11]. If i? = M — 1 for example, a finite rate 
feedback system is expected to perform within about 3 dB of a perfect CSIT system. The capacity with 
CSIT, no CSIT, and finite rate feedback with B = M - 1 = 3 is shown for a 4 x 1 MISO system in Fig. 
121 Notice that there is a 6 dB gap between the CSIT and no CSIT curves, while the finite rate feedback 
curve is 2.7 dB from the perfect CSIT curve, which is quite close to our approximation of 3 dB. 

The key point to take from this discussion is that the feedback load need not be increased as a function 
of SNR in order to maintain a constant power or rate gap relative to the perfect CSIT capacity curve. 
This is perhaps not surprising, since the amount of feedback only affects the cos^ ^Z(h, h)^ term and 
thus leads to only a SNR degradation. Furthermore, also note that the multiplexing gain (i.e., the slope of 
the capacity curve) is not affected by the level of CSIT. For the MIMO downlink channel, however, the 
multiplexing gain of a M-transmit antenna, K user system is min(A^, K) when there is perfect CSIT, but 
is only 1 if there is no CSIT. Clearly such a channel will be considerably more sensitive to the accuracy 
of the CSIT obtained through the finite rate feedback channels. 

IV. Downlink Precoding with Finite Rate Feedback 

After the transmitter has received feedback bits from each of the K receivers, an appropriate multi-user 
transmission strategy must be chosen. We propose using zero-forcing beamforming based on the channel 



11 



quantizations available at the transmitter. Zero-forcing (ZF) is a low complexity transmission scheme 
that can be implemented by a simple linear precoder, and its performance is optimal amongst the set 
of all linear precoders at asymptotically high SNR [19]. We later provide numerical results describing 
the performance of regularized zero-forcing precoding [30], which outperforms pure zero-forcing at low 
SNR's but is equivalent to ZF at asymptotically high SNR. 

Note that dirty paper coding cannot be directly applied in this scenario because of the imperfection in 
CSIT. In order to implement DPC, the transmitter in fact requires knowledge of the multi-user interference 
at the receiver, and not at the transmitter. The received interference clearly depends on the channel state, 
which is not known perfectly at the transmitter in the finite rate feedback model. The transmitter could 
estimate the received interference based on the available channel quantization, but even this estimated 
interference cannot be cancelled perfectly using DPC due to the transmitter's imperfect knowledge of the 
received signal power. In order to implement DPC, the transmitter must also know the received SNR (in 
the absence of any interference) in order to properly select the inflation factor, which is a key component 
in the dirty paper coding implementation [23]. Since the SNR also depends on the channel realization, 
the transmitter only has an imperfect estimate of the SNR and thus cannot properly select the inflation 
factor. 

When the transmitter has perfect CSI, zero-forcing can be used to completely eliminate multi-user 
interference by precoding transmission by the inverse of the channel matrix H. This creates a parallel, 
non-interfering channel to each of the M receivers, and thus leads to a multiplexing gain of M. In the finite 
rate feedback setting, the imperfection in CSIT makes it impossible to completely eliminate all multi-user 
interference, but a zero-forcing based strategy can still be quite effective. Since the transmitter only has 
knowledge of the channel quantizations but does not have any information regarding the magnitude or 
spatial direction of the quantization error, a reasonable approach to take is to select beamforming vectors 
according to the zero-forcing criterion based on the channel quantizations. 



Let hj refer to the quantized version of the mobile i's channel. These quantized vectors are compiled 

. it 

hi h2 • • • hj\/ ■ The matrix H is the estimate of the channels, upon which 



into a matrix: H 



zero-forcing is performed. Thus, the beamforming vectors are chosen to be the normalized columns of 

_p 

M 



the matrix H ^. If equal power is used for each of the data streams, the received SINR at the i-th 
mobile is given by 0: 



SINRi = J^^' , (10) 



Since the beamforming vectors are chosen orthogonal to the channel quantizations and not the actual 
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channel realizations, the interference terms in the denominator of the SINR expression are not zero. 
However, these terms directly depend on the quantization error and thus can be analyzed using the 
statistics of random vector quantization. 

We study long-term average throughput (over both the fading distribution and RVQ), and thus the rate 
of transmission to User i is equal to Ri = Sh,w [log2(l + SINRi)] if Gaussian inputs are used. By 
symmetry, the system throughput is given by: 

Rfb{P) = MEu,w [log2(l + SINR)] . 

For a system that achieves a throughput of R{P) (where P is the SNR, or power constraint), the 
multiplexing gain is defined as: 

r= lim (11) 

P-oolog2(P) 

V. Throughput Analysis 

In this section we analyze the throughput of a feedback-based zero-forcing system. We first state some 
useful preliminary calculations, and then study the achieved throughput for fixed and increasing feedback 
levels. 

A. Preliminary Calculations 

In this section we prove a few useful results regarding the distribution of terms in the SINR expression 
in ilOl . For the remainder of this paper, we use hj to denote the normalized channel vector, i.e., = 
hj/||hj||. Using this notation, we can rewrite the SINR as: 

SINRi = '' ' ' ' . (12) 

We first characterize the numerator of this expression, i.e., the received signal power: 
Lemma 2: The beamforming vector is isotropically distributed in C^^ and is independent of the 
channel direction hj as well as the channel quantization hj. 

Proof: By the zero-forcing procedure, Vj is chosen in the nuUspace of {h.j}j^i. Since RVQ is 
used and the channel directions {hjl^yi are independent isotropic vectors, the channel quantizations 
hi, . . . ,h.M are mutually independent isotropically distributed vectors. Thus the nuUspace of {h.j}j^i is 
an isotropically distributed direction in C^^, independent of either h, or hj. ■ 
Clearly the same argument holds if the transmitter performs zero-forcing on the basis of perfect CSIT 
(i.e., hi = hi). 
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Next we characterize the interference terms that appear in the denominator of the SINR expression: 
Lemma 3: The random variable |hjvj p for any i j is equal to the product of the quantization error 
sin^ (^Z{hi, hj)^ and an independent beta (1, M — 2) random variable. 

Proof: Without loss of generality, consider i = 2 and j = 1, i.e. the term |h2Vip. The vector vi is 
chosen in the nuUspace of h2, . . . , h.M, each of which is an independent isotropically distributed vector. 
Therefore, vi is isotropically distributed within the (M — 1) -dimensional nuUspace of h2. Now consider 
the normalized channel vector h2. Since RVQ is used, the quantization error has no preferential direction, 
i.e. the error is isotropically distributed in C^^. Thus, conditioned on the magnitude of the quantization 
error a = sin^ ^Z(hj,hj)^, the channel direction can be written as the sum of two vectors, one in the 
direction of the quantization, and the other isotropically distributed in the nuUspace of the quantization: 
h2 = \/l — cih.2 + \^s, where s is isotropically distributed in the nuUspace of h.2, and is independent 
of a. Therefore, the random variable h2 can be written as: 

h2 = (Vl - Z)h2 + VZS, 

where s and Z are independent, with s isotropically distributed in the nuUspace of h2 and Z distributed 
according to the quantization eiTor distribution, i.e., the minimum of 2^ beta (M — 1, 1) random variables, 
as described in Section IIII-CI 

The inner product of h2 and vi is then given by: 

|htvi|2 = (1-Z)|h5vi|2 + Z|stvi|2 

Since s and vi are iid isotropic vectors in the (M — 1) -dimensional nulUspace of h2, the quantity |sWip 
is beta (1,M — 2) distributed, and is independent of Z. ■ 
Since a beta random variable has support [0, 1], we have 

Ihjv.f < sin2 (Z(hi, hi)) , Vi ^ J, (13) 

i.e., the interference from any single user is no larger than the quantization error. When M = 2, we 
clearly have |h|vjp = sin^ ^Z(hj,hj)), and no beta random variable is needed. 

Finally, a derivation of the expectation of the logarithm of the quantization error, which is useful in a 
few subsequent theorems, is given: 

Lemma 4: The expectation of the logarithm of the quantization error is given by: 



log,(si„'(Z(h,.h.)))]=-^Ei 

k=l 
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Furthermore, this quantity can be bounded as: 



B 



M - 1 
Proof: See Appendix HlJ 



< —Eu.w 



log2 (sin^(Z(hi,hi)) 



< 



B + log2 e 
M- 1 ■ 



B. Fixed Feedback Quality 

We now analyze the average throughput achieved by the proposed zero-forcing scheme, and quantify 
the performance degradation as a function of the feedback rate. In order to study the performance loss, we 
define the rate gap AR{P) to be the difference between the per mobile throughput achieved by perfect 
CSIT-ZF and finite -rate feedback based ZF: 

1 



AR{P) 



M 



[Rzf{P) - Rfb{P)]- 



In this expression Rzf{P) refers to the throughput achieved by perfect CSIT-based zero-forcing (i.e., 
hj = hj), which is given by: 

Rzf{P) = MEnlog^ (^1 + ^|hJvzF,ip) , 

where each beamforming vector vzf,! is chosen orthogonal to {h.j}j^i. 

Theorem 1: Finite rate feedback with B feedback bits per mobile incurs a throughput loss relative to 
perfect CSIT zero forcing that can be upper bounded by: 



AR{P) < log2 [l + P -2 
Proof: The rate gap can be upper bounded as: 
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where (a) follows because ^j^i — *-* ^^'^ log(") is a monotonically increasing function. To get 

(b), note that vzF,! and Vj are each isotropically distributed unit vectors, independent of h,, by Lemma 



12 which implies En 



log2 



log2 



1 + -P ihV-P 



inequality and exploiting the independence of the channel norm (which satisfies £^[||hj|p 
channel direction, we get: 



Applying Jensen's 
M) and 
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ARiP) < log2(l + -(M-l)i^[||hi||^]£;[|hJv„ 
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log2 1 + P(M-I)i?[|hjv,f] 



By Lemma |3j the term £^[|h|vjp] is the product of the expectation of the quantization error and the 
expectation of a beta(l,M — 2) random variable, which is equal to Using Lemma ^ we have: 

AR{P) < log2 (l + P-^[sin2(Z(hi,hi))] 
< log2 (l + P- 2"^-i 



The most important feature to notice is that the rate loss is an increasing function of the system SNR 
(P), which can be explained by the linear relationship between P and the multi-user interference power. 
This intuition motivates the following result, which shows that a finite -rate feedback system with fixed 
feedback quality is interference-limited at high SNR: 

Theorem 2: The throughput achieved by finite-rate feedback-based zero-forcing with a fixed number 
of feedback bits per mobile B is bounded as the SNR is taken to infinity: 
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where in (a) we consider only one of the multi-user interference terms, and (b) uses the fact that |hjvj| < 1 
and |h|vj| < 1. Since |h|v, p is the product of the quantization error and a beta random variable (Lemma 



|3ll, we have 



log2 (Ihlvjf 



-Eu,w [log2 (sin2(Z(h„hi)))] - Ey [log^Y] 
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where we have used Lemmaj^as well as the easily verifiable fact that E[— log2 Y] = (log2 e) J2 



M-2 1 
k=l k 



< 



log2(M— 2)+log2 e when Y is beta (1, M—2). Plugging this expression into the upper bound on Rpb{P) 
gives the result. ■ 

Regardless of how many feedback bits {B) are used, the system eventually becomes interference 
limited because interference and signal power both scale linearly with P. In Fig. |3] the performance of 
a 5 antenna, 5 user system with 10, 15, and 20 feedback bits per mobile is shown. When the SNR 
is small, limited feedback performs nearly as well as zero-forcing. However, as the SNR is increased, 
the limited feedback system becomes interference limited and the rates converge to an upper limit, as 
expected. Although the upper bound in Theorem |2 is quite loose in general, it does correctly predict the 
roughly linear dependence of the limiting throughput and B. 

Notice that this interference-limited behavior can easily be avoided by reverting to a TDM A strategy, 
but this only provides a multiplexing gain of one. However, as validated by numerical results in Section 
IVI[ TDMA is preferable to feedback-based zero forcing at high SNR's if B is kept fixed. 



C. Increasing Feedback Quality 

In the previous section we studied systems with fixed feedback rates, and showed that the throughputs 
of such systems are bounded. In this section we show that this interference-limited behavior can be 
avoided by scaling the feedback rate linearly with the SNR (in dB). In fact, if the feedback rate is scaled 
at the appropriate rate, the full multiplexing gain of M is achievable. In addition to achieving the full 
multiplexing gain, it is also desirable to maintain a constant rate offset between the rates achievable with 
zero-forcing with perfect CSI and with finite -rate feedback. Note that if a bounded rate gap is maintained, 
the fuU multiplexing gain is also achieved. The following theorem specifies a sufficient scaling of feedback 
bits to maintain a bounded rate gap: 
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Fig. 3. 5x5 Channel with Fixed # Feedback Bits 



Theorem 3: In order to maintain a rate offset no larger than log2 b (per user) between zero forcing 
with perfect CSI and with finite-rate feedback, it is sufficient to scale the number of feedback bits per 
mobile according to: 

B = (M-l)log2P-(M-l)log2(&-l) (14) 
M - 1 



^ -PdB-{M -l)log^{b-l). (15) 
Proof: In order to characterize a sufficient scaling of feedback bits, we set the rate gap upper bound 
given in Theorem ^ equal to the maximum allowable gap of log2 b: 

AR{P) < log2 (^1 + P • 2"^) = log2 b. 

By inverting this expression and solving for B as a function of b and P we get: 

B = (M-l)log2P-(M-l)log2(6-l) (16) 

= '^'-;y°^-'° P.B-(Af-l)Iog.(i>-l) (17) 

M - 1 

~ PdB-(M-l)log2(6-l). (18) 

With this scaling of feedback bits, we clearly have AR{P) < log2 b for all P, as desired. ■ 
The rate offset of log2 b (per user) can easily be translated into a power offset, which is a more useful 
metric from the design perspective. Since a multiplexing gain of M is achieved with zero-forcing, the 
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Fig. 4. 5x5 Channel with Increasing # Feedback Bits 



zero-forcing curve has a slope of M bps/Hz/3 dB at asymptotically high SNR. Therefore, a rate offset 
of log2 b bps/Hz per user, or equivalently M \og2 b bps/Hz in throughput, corresponds to a power offset 
of 31og2 b dB [31] [32]. Thus, b = 2 corresponds to a 3 dB offset, and the resulting scaling of bits takes 
on a particularly simple form when a 3 dB offset is desired: 



B 



M - 1 



PdB bits/mobile. 



(19) 



In order to achieve a smaller power offset, b needs to be made appropriately smaller. For example, a 
1-dB offset corresponds to b = 10^/^'^ = 1.259 and thus an additional 1.95(M — 1) feedback bits are 
required at all SNR's. 

In Fig. 1^ throughput curves are shown for a 5 antenna, 5 user system. The feedback load is assumed 
to scale according to the relationship given in ( fT9t . and limited feedback is seen to perform within 2.6 
dB of perfect CSI zero-forcing. Notice that the actual power offset is smaller than 3 dB primarily due to 
the use of Jensen's inequality in deriving the upper bound to AR{P) in Theorem ^ The sum capacity, 
which outperforms zero-forcing by 5.55 dB in a 5 x 5 system [19], is also shown. In Fig. |5j throughput 
curves are shown for a 6 antenna, 6 user system. In this figure, B is scaled to guarantee a 3 dB and 6 
dB gap from perfect CSIT zero-forcing (i.e., b = 2 and 6 = 4 in (fT5t). Again, the actual gaps are smaller 
than the bounds (2.5 dB and 5 dB, respectively), but are still sufficiently close to make the bounds useful. 
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Fig. 5. 6x6 Channel with Increasing # Feedback Bits 



If B is scaled at a rate strictly greater than (M — 1) log2 P, i.e. B = a log2 P for any a > M — 1, 
the upper bound to the rate gap (Theorem Q is easily shown to converge to zero: 

lim AR{P) < lim loga (l + P- 2'^) = 0, 

which implies that the rate gap itself converges to zero. Thus, the throughput achieved with limited 
feedback converges (absolutely) to the perfect CSIT throughput at asymptotically high SNR. 

However, scaling Bata rate slower than (M — 1) log2 P results in a strict reduction in the multiplexing 
gain, as the following theorem shows: 

Theorem 4: If B is scaled as S = Q!log2i-* for a < M — 1, the throughput curve achieves a 
multiplexing gain of AI (^j^zj^- 

Proof: See Appendix IIVI ■ 
Intuitively, the signal power grows linearly with P, while the interference power scales as the product 
of P and the quantization error. Since the quantization error is of order P , the interference power 
scales as P^^^ m-J, which gives an SNR that scales as P^-i _ Thus, the resulting multiplexing gain is 

In Figure |6l the throughputs achieved with feedback rates given hy B = 0.5(M — l)log2-P and 
B = 1.3(M — 1) log2 P in a 4 X 4 channel are shown. When a = 0.5(M — 1), the achieved multiplexing 
gain is only 0.5M = 2, which corresponds to a slope of 2 bps/Hz per 3 dB. On the other hand, when 
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Fig. 6. Multiplexing Gain as a Function of Feedback Scaling 



a = 1.3(M — 1), the full multiplexing gain is achieved and the rate gap relative to perfect CSIT zero- 
forcing converges to zero at high SNR. 

D. RVQ vs. Optimal Vector Quantization 

Given the previous sections results, an important issue to consider is the possible performance loss 
incurred by use of random vector quantization instead of an optimal vector quantizer. In this section we 
derive an upper bound to the bit savings that can result from using optimum vector quantization, and 
find that the RVQ penalty is actually very small. This is in accordance with results on the optimaUty of 
RVQ for point-to-point MIMO and MISO systems [11]. 

In order to determine the sub-optimality of RVQ, we utilize a lower bound to the quantization error 
of any vector quantization codebook developed in [5], [33]: 

Lemma 5: Consider an arbitrary B-\Ai quantization codebook {wi, . . . , W2s}, and let the random 
variable Y denote the quantization error Y = sin^(Z(hj, hj)). The random variable Y stochastically 
dominates the random variable Z, whose CDF is given by: 

Fziz) = { ~ ~s (20) 

1, z> 

or F^iz) > Fy{z) for all < z < 1. 
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This lemma applies to the quantization error for a fixed quantization codebook, where the randomness 
is due only to the isotropic distribution of the channel vector. If RVQ is used, this result can be applied to 
each quantization codebook, and therefore still holds when considering the random variable describing the 
quantization error, where there is randomness over both the channel realizations as well as the codebooks: 

Corollary 1: If the random variable Z = sin^(Z(hj, hj)) denotes the quantization error achieved when 
using RVQ (as described in Section Illl-Ct . then Z stochastically dominates Z, whose CDF is given in 
(EOj. 

In order to determine the difference between RVQ and and a possibly optimal quantizer, we compute 
the feedback scaling required to maintain a bounded rate offset as in Theorem |3j but assuming the 
quantization error is described by Z instead of Z. In order to compute this scaling, we first must compute 
the expectation of Z. A simple calculation yields: 

E[Z] = j\l-F^{z))dz={^^y-^^. 

Notice that this differs from the upper bound to E[Z] (Lemma only in the term and thus the 

interference power is reduced by at most a factor of In order to solve for the required feedback 

with this bound on the quantization error we set: 

M- 1 



logs ( 1 + P • ^^^2-— ) A log^ 5^ 



and solve for B, which gives: 



B = (M-l)log2P-(M-l)log2(6-l)-(M-l)log2(^^^). (21) 



Comparing this with the similar term for RVQ in ^tim . we see that the bit savings is a constant factor 
of (M — 1) log2 jvT^) ^^^^ SNR's. Furthermore, using the fact that loge(l + x) < x for x > 

we have (M - 1) log^ [j^^ = (M - 1) log^ {l + < (M - = 1- Converting to base 2, 

we see that (M — 1) log2 ^ jj^-^ ^ < log2 e bits, or approximately 1.44 bits. Thus, using RVQ leads to at 
most a 1 .44 bit penalty relative to optimum vector quantization, which is quite small relative to the total 
feedback load. We should note that this is somewhat of an approximation because we have used Jensen's 
inequality to derive the feedback load for both RVQ and for the lower bound. However, numerical results 
validate the accuracy of Theorem |3j and thus of this metric. 

An alternative method to measure RVQ against optimum vector quantization is to compare the perfor- 
mance for the same number of feedback bits, as opposed to the above analysis in which we compared 
the required feedback load required for identical performance. As stated earlier, the quantization error. 
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and thus the interference, is a factor of 



M-l 
M 



smaller in the lower bound. If the feedback is scaled in 



order to maintain a 3 dB gap from perfect CSI zero forcing, the noise and interference term are kept 
bounded by two (which corresponds to 3 dB). The lower bound, on the other hand, would be 1 + ^'^^"^ 
instead of 2. For M = 5, this corresponds to 2.55 dB, or a 0.45 dB advantage relative to RVQ. As the 
number of transmit antennas M increases, this gap clearly goes to zero. 

E. Regularized Zero-Forcing 

Although zero-forcing precoding performs quite well at moderate and high SNR's, regularization can 
significantly increase throughput at low SNR's [30]. In fact, this is exactly analogous to the difference 
between zero-forcing equalization and MMSE equalization: while zero-forcing results in complete can- 
cellation of (inter-symbol) interference, an MMSE equalizer alternatively allows a measured amount of 
interference into the filtered output such that the output SNR is maximized. Regularized zero-forcing is 
also implemented through a linear precoder, but with a slightly different selection of beamforming vectors. 
If H denotes the concatenation of quantization vectors available to the access point, the zero-forcing 
beamforming vectors are chosen as the normalized columns of H~^, or equivalently of H^(HH^)~^. 
With regularized zero-forcing, the beamforming vectors vi , . . . , -vk are chosen to be the normalized 
columns of the matrix: 



The use of the regularization constant ^ is well motivated by results in [30] as well as the optimal MMSE 
filters on the dual multiple access channel [16]. It is clear from this regularization that the regularized 
beamforming vectors will converge to standard zero-forcing vectors at asymptotically high SNR. 

Since the rates achieved by zero-forcing and regularized zero-forcing converge at asymptotically high 
SNR, the feedback scaling specified in Theorem |3l gives the desired rate/power offset for regualarized 
ZF at asymptotically high SNR. Although we have not been able to extend Theorems ^ or |51 to the rate 
offset between regularized ZF based on perfect versus feedback-based CSIT, numerical results indicate 
that these results actually hold at all SNR's, and not just at very high SNR values. In fact, numerical results 
indicate that Theorem 13 more accurately predicts the rate offset for regularized ZF than for standard ZF 
at low and moderate SNR values. 

Throughput curves for zero-forcing (solid lines) and regularized zero-forcing (dotted lines) with perfect 
CSIT and feedback-based CSIT (with B = ^^^Pds) are shown for a 5 x 5 channel in Figure The 
throughput achieved with regularized ZF with perfect CSIT is significantly larger (by approximately 4 
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Fig. 7. 5x5 Channel with Regularized Zero-Forcing 

bps/Hz) than the throughput with perfect CSIT based ZF for SNR's between and 15 dB, but the two 
curves converge at very high SNR. The same is true for the throughput achieved with finite rate feedback 
and regularized ZF vs. standard ZF. Furthermore, while the rate offset between perfect CSIT-ZF and 
feedback-CSIT ZF increases from nearly zero at low SNR to its limiting value at high SNR, the rate 
offset between the two regularized ZF curves is relatively constant over the entire SNR range, as noted 
earlier. 

VI. Performance Comparison 

In this section we present numerical results comparing the throughput achieved with finite rate feedback 
and two alternative transmission techniques for the MIMO downlink, random beamforming and TDMA. 
Because regularized ZF outperforms ZF at all SNR's, we only consider regularized ZF systems. 

Random beamforming is an extension of opportunistic beamforming [34] to the multiple antenna 
downlink [7]. The transmitter randomly chooses M orthogonal beamforming vectors and transmits pilot 
symbols along these vectors. Each mobile measures the SINR of each beamforming vector, and feeds 
back the index of the vector with the highest SINR (requiring log2 M bits), along with the corresponding 
SINR. The access point then transmits to the best user on each of the beamforming vectors. The required 
feedback per mobile is quite small (log2 M bits plus an analog SNR value, which will presumably be 
sufficiently quantized), but this scheme does not perform well in systems with a moderate number of 
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Fig. 8. 4x4 Channel with Scaled Feedback, TDMA, & Random BF 

mobiles (i.e., K M). In addition, random beamforming is interference limited at asymptotically large 
SNR if the number of mobiles is kept fixed. 

TDMA, in which the access point serves a single user at a time, is perhaps the simplest downlink 
transmission scheme. We consider the TDMA throughput achievable with perfect CSIT, in which the 
access point transmits (using the capacity-achieving beamforming strategy) to only the user with the 
largest SNR. While it is possible to incorporate the effect of finite-rate feedback into a TDMA system 
(as described in Section IIII-DI) . the effect is relatively negligible at the feedback levels considered here 
and thus for simplicity we consider perfect CSIT. Since a TDMA system achieves a multiplexing gain 
of only one, we expect to see a significant throughput degradation if TDMA is used, particularly at high 
SNR. However, note that the difference between the sum capacity of the MIMO downlink (achieved by 
DPC) and the achievable TDMA throughput is not particularly large at SNR's less than 5 or 10 dB [2]. 

Figure [S] plots achievable throughput for a finite -rate feedback system with B scaled to maintain a 3 
dB offset (equation (fT9l). TDMA, and random beamforming, in a 4 x 4 channel. Finite-rate feedback 
outperforms random BF beyond 5 dB, which is not surprising given that the feedback level per mobile 
(which is given hy B = P^b for this particular channel) is significantly higher than for random BF. 
Finite -rate feedback and TDMA give approximately the same throughput up to 10 dB, after which the 
feedback system begins to significantly outperform TDMA, due to the superior multiplexing gain of 
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Fig. 9. 4x4 Channel with Fixed Feedback, TDMA, & Random BF 

the zero-forcing system. The same channel is considered in Figure |9l but with fixed feedback levels 
(B = 5, 10, 15, 20) in the finite -rate feedback system^. Here we see that TDMA is a better choice than 
finite-rate feedback with either 5 or 10 bits of feedback per mobile. If 15 or 20 feedback bits are permitted, 
however, finite-rate feedback can provide a significant advantage over TDMA, particularly above 10 dB. 

Figure [TOl displays achievable throughput in a 8 x 8 channel for finite-rate feedback systems with B 
scaled according to equation (fWt and with B = 20, along with TDMA and random BF throughputs. The 
throughput achieved with scaled feedback is considerably larger than the TDMA throughput, due to the 
large number of spatial degrees of freedom used by the zero-forcing system. The 20 feedback bit system 
outperforms TDMA at moderate SNR's, but hits the interference limited regime around 15 or 20 dB. 

In general, finite-rate feedback based zero-forcing outperforms TDMA at SNR's above 5 or 10 dB if 
the feedback per mobile is sufficiently large, but does not provide a significant advantage at low SNR's. 
Furthermore, random beamforming is outperformed by both TDMA and finite -rate feedback at essentially 
all SNR levels. However, this is largely due to the limited number of mobiles, as random beamforming 
does provide excellent performance in systems with many more mobiles than AP antennas. 

^Note that the finite-rate feedback throughput decreases with SNR in some cases due to the decreasing regularization factor 
as a function of the SNR. A more careful tuning of this parameter can prevent this behavior, but does not significantly increase 
throughput. 
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Fig. 10. 8x8 Channel with Feedback, TDMA, & Random BF 

VII. Conclusions 

The use of multi-user MIMO techniques can significantly increase downlink throughput without re- 
quiring large numbers of antennas at each mobile device. However, it is crucial that the transmitter have 
accurate channel state information in order to realize these gains. In fact, the availability of channel 
state information appears to be the critical issue that will determine the feasibility of multi-user MIMO 
techniques in future wireless communication systems. 

We investigated the finite-rate feedback model, in which each mobile quantizes its channel reaUzation 
to a finite number of bits that are fed back to the transmitter. Although this model has been extensively 
studied for point to point MIMO channels, conclusions are quite different in the multi-user setting. Our 
primary result showed that the number of feedback bits per mobile must be increased linearly with the 
SNR (in dB) in order to realize multi-user MIMO benefits. As a result of this, feedback levels are quite 
high in MIMO downlink channels: in a 4 antenna system, for example, each mobile must feed back 10 
bits at an overall system SNR of 10 dB. In contrast, feedback does not need to scale with SNR in point 
to point MIMO systems, and even a relatively small number of bits (e.g., 4 or 5) can be very beneficial at 
all SNR levels. This scaling relationship is particularly troublesome because multi-user MIMO techniques 
are generally most beneficial in the high SNR regime. 

The intuition for the extreme sensitivity of the MIMO downlink channel to imperfection in CSIT, as 
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compared to point-to-point MIMO channels, is actually quite straightforward. In point-to-point MIMO 
channels, imperfect CSIT leads to mismatch between the input and the transmission modes of the channel 
and thus some "wasted" transmission power (e.g., power that is transmitted into the nuUspace of the 
chaimel), which reduces the received SNR but has no other deleterious effect. In a MIMO downUnk 
chaimel, imperfect CSIT also leads to mismatch between the input and the chaimel, but the effect of 
this is increased multi-user interference, which is significantly more harmful than a reduction in desired 
signal power. 

There are, however, a number of reasons to be optimistic regarding the issue of CSI feedback in 
downlink channels. First note that we have considered only the most basic iid Rayleigh block fading 
model, and feedback rates can surely be decreased by exploiting spatial and temporal correlation inherent 
in any physical fading process, as has already been extensively studied for point to point MIMO channels 
[35] [36] [37] [38]. Furthermore, recent work has shown that a small number of mobile anteimas can be 
used to significantly improve quantization quality and thereby decrease the required feedback rates [39]. 
In addition, it may also be possible to exploit multi-user diversity effects and thereby reduce feedback 
rates in systems with a large number of mobiles [7] [13] [9]. Within this body of work, one interesting 
idea is to have only a subset of mobiles (e.g., mobiles meeting a fixed SNR threshold) feed back their 
chaimel information [40] [41] [9]. While this technique can significantly reduce the total number of 
feedback bits to be transmitted, it also requires contention-based feedback (i.e., random access), which 
can lead to throughput reduction as well as additional latency on the feedback channels. Although we 
have considered only digital feedback methods, analog feedback appears to have a number of attractive 
properties for the downlink channel [42] [43] [44]. A careful comparison of analog and digital techniques 
remains to be performed. 

We close by mentioning a few important issues not considered in this work. One key assumption made 
in this work is that the channel feedback is instantaneous. Of course, there will be some non-zero delay 
associated with transmitting feed back bits from the mobiles to the access point, and this delay can be 
quite significant in fast fading (large Doppler spread) channels. In fact, numerical results in [44] indicate 
that feedback delay can severely limit the performance of certain downlink transmission schemes at even 
moderate levels of Doppler. In addition, each mobile is assumed to have perfect CSI, while there will be 
non-zero receiver estimation error in any practical system. This will clearly lead to additional imperfection 
in the CSI provided to the access point, and could have significant effects. Frequency-selective channels 
also require attention; consider related work on frequency-selective point-to-point MIMO channels [45] 
[46]. Finally, note that we have only considered frequency division duplexed systems. Chaimel reciprocity 
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can be exploited to acquire downlink channel state information from the uplink in time division duplexed 
(TDD) channels, although recent results indicate that, somewhat counter-intuitively, TDD may in fact be 
less attractive than FDD from a channel state information perspective [42]. Many of the tools used here 
appear to be well suited to analyze the effect of imperfect CSIT in TDD systems as well. 
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Appendix I 
Expected Quantization Error 

In this appendix we provide an alternative proof for the closed-form representation of the expected 
quantization error. The following integral representation for the beta function is given in [47, pg. 5]: 




a > 0, b> 0, c> 0. 



With a = 1, 6 = M - 1, and c = 2^ -M this yields: 
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Using the fact that Pr (sin^ (Z(hj,hj)) > z) = {1 





— z^ ^)^^, we have: 
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where we have used the fundamental equahty T{z + 1) = zT{z). 
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Appendix II 
Proof of Lemma [T] 

The expected quantization error is given by [26]: 
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For M > 2: 
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The above inequality is reached because r(x) < 1 for 1 < x < 2, due to the convexity of the gamma 
function [27] and the fact that r(l) = r(2) = 1. By applying Kershaw's inequality for the gamma 
function [48]: 
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with X = 2^ + and s = 1 — tjjti we get 
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Furthermore, the decreasing nature of the function (•) gives a further upper bound of 2 
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Appendix III 
Proof of Lemma m 

Let Z = sin^(Z(hj, hj)) represent the quantization error. As stated in Section ITlI-CI Z is the minimum 
of 2^ beta (M - 1,1) random variables with CCDF given by: Pr(Z > z) = {1 - z*^"^)-^, where 
L = 2^. We wish to compute E[log^Z], or equivalently E[—log^Z]. Since < Z < 1, the random 
variable — logg Z is non-negative with support [0, oo). Using the fact that E[X] = Fr{X > x)dx for 
non-negative random variables and the binomial expansion, we have: 

^[-log.Z] = / Vv{-\og^Z>z)dz 
Jo 

/•oo 

= / Pr(Z < e-'')dz 
Jo 

l-OO 

= / 1 - (1 - e-^(^'^-i))^(iz 

= /"l-^('^V-l)^e--(^^-i)^dz 
k=o ^ ^ 

= jJ|"X;(^)(-l)^+ie-^(^^-i)'=dz 



k=l ^ 



1 ^ 

M - 1 ^ 

k=l 



L\ (-1)' 



M - 1 ^ fc' 



k=l 

where the final line follows from [49, Section 0.155]. 



Furthermore, since logg a = ^dx, we have logg L < X^^^^ ^ < logg L + 1. We multiply by Iog2 e 
to translate to base 2, and thus get E[— log2 Z] = ^k=i i- Using L = 2^ we get the subsequent 
bounds. 
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Appendix IV 
Proof of TheoremIH 

The multiplexing gain can be expanded as: 

MEii^w[RFB{P)] 



m 



lim 

P^oo 



MEnw 



lim ■ 

P^oo 



log2(l + ^|hlv,|2 + ^^._,.|hTv,- 



t,,.|2 



MEu. 



w 



- lim ■ 

P^oo 



MR 



M - lim ■ 

P^oo 



The final step follows because 
P 



logo 1 + 



M 



hjv.p) <log2 ||l + ^|hlv.|2 + ^5^|hlv,f j <log2(l + P||h.||2), 



and the multiplexing gain of the upper and lower bounds are easily shown to be one. 
We first show that m < jf^ (using L = 2^ = P°^): 



P 



4v/ 



> Eu,w 



=2 



logo ( ^Ihlv-I^ 



legal hi v,f 



log2 + En [log2 ||hi||2] + Eh,w 

log^P + En,w [log2 (sin2(Z(h„hi)))] +0(1) 



(23) 



k=l 



M 



> logaP 



1 



M -1 



log2L + 0(l) 



M - 1 



log2P + 0(l) 



where we have used Lemma |3 to evaluate the expectation of the logarithm of the interference term. 
Plugging this bound in to (l23l gives m < M 



M-l 



To show m > M (^j^^^, we upper bound the limit in (I23t using the fact that |h|vjp < sin^(hj, hj) 
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for any i ^ j (from (IT3l ) and Jensen's inequality: 



w 



log2 



log2 ( 1 + — (M-l)||hi||2sin2(hi,h, 



< log2 (l + P{M - l)E [sin2(h,, 

< log2 (l + P(M- 1)2"^) 
= log2 (^1 + (M- 



Thus we have 

lim 



|2 



< M 1 



P-.00 log2 -P ~ \ M - I 

which gives m > M (^jj^^- 
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